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(54) Similarity-based document retrieval 

(57) Methods and apparatus are provided for 
accessing an experience journal which includes 
unstructured text items relating to a topic, such as a 
medical condition. The method is implemented in a 
computer system including a processor, a storage 
device, a video display unit having a display screen, and 
a user interface. The unstructured text items are stored 
in the storage device. Similarities among the unstruc- 
tured text Hems are determined, and icons, one corre- 
sponding to each of the unstructured text items, are 
displayed on the display screen. The icons are posi- 
tioned on the display screen relative to each other, such 
that the distances between icons are representative of 
the determined similarities among the unstructured text 
items. In response to user selection of one of the icons, 
the corresponding unstructured text item is displayed on 
the display screen. 
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Description 

field of the Invention 

10001] This Invention relates to methods and appara- 
tus for providing computerized access to an experience 
journal containing unstructured text items relating to a 
topic and, more particularly, to a computerized experi- 
ence journal wherein text items are automatically 
arranged according to similarity and wherein text items 
of interest can be accessed with extreme ease. 

Background of the Invention 

[0002] Medical advances make it increasingly possi- 
ble for children with previously fatal illness to live and 
thrive. However, a significant number still experience 
repeated operations, hospitalizations and invasive pro- 
cedures, or need special care at home. Many do so with 
little or no intervention to help them and their families 
cope with the emotional stresses involved. Serious 
medical illness is, therefore, increasingly recognized as 
an important early risk factor for emotional disturbance. 
[0003] A variety of interventions have been developed 
to assist patients and their families with the hope of 
improving the resiliency of both. These include pre- 
admission hospital preparatory programs, meetings 
between families and physicians, books and videos for 
children of different ages and psychiatric consultation. 
Yet it is clear that the majority of families are not able to 
avail themselves of these resources before coming to 
the hospital. 

[0004] One additional and potentially underutilized 
source of psychological support is the community of 
patients and families who have experienced hospitaliza- 
tion. However, in spite of a general willingness to share 
experiences, communication among patients and fami- 
lies is usually limited. To facilitate this process, the use 
of computer technology to record, organize and display 
stories about the experiences of families with children 
who have been treated for serious illness has been pro- 
posed. Children and their families are asked to record 
text and multimedia vignettes describing some aspect 
of their illness, coping strategies or care that might be 
useful to others. The collection of text and vignettes is 
referred to as an experience journal. 
[0005] A difficulty arises in providing access to the 
items in the experience journal. The text items are 
highly unstructured and differ greatly in style, content 
and sophistication. The items may be prepared by any- 
one from a physician or a parent to a young child. Thus, 
organizing the text items for access is extremely diffi- 
cult. The user group likewise may vary from medical 
professionals to parents to very young children. Thus, 
the text items must be available to users with limited 
computer skills. Because the text items are unstruc- 
tured and differ greatly in content, existing techniques of 
index, classification and search are not useful. Accord- 



ingly, there is a need for methods and apparatus for 
organizing and providing access to an experience jour- 
nal which includes unstructured text items relating to a 
topic. Summary of the Invention 

5 [0006] According to a first aspect of the invention, 
methods and apparatus for providing access to an 
experience journal including unstructured text items 
relating to a topic are provided. The method is imple- 
mented in a computer system comprising a processor, a 

10 storage device, a video display unit having a display 
screen, and a user interface. The computer system exe- 
cutes the steps of storing the unstructured text items in 
the storage device, determining similarity among the 
unstructured text items and displaying icons, one corre- 

75 sponding to each of the unstructured text items on the 
display screen. The icons are positioned on the display 
screen relative to each other such that the distances 
between icons are representative of the determined 
similarities among the unstructured text items. In 

20 response to user selection of one of the icons, the cor- 
responding unstructured text item is displayed on the 
display screen. 

[0007] The similarity among the structured text items 
may be determined by analyzing the unstructured text 

25 items for use or the same words. In particular, stop 
words may be removed from the text items and suffixes 
may be removed from words in the text items. The 
remaining words in the text items are weighted accord- 
ing to the inverse of their occurrence frequency in a ref- 

30 erence text to provide a word vector for each of the text 
items. A similarity score is determined for each pair of 
unstructured text items by determining the dot product 
of the<;orresponding word vectors. 
[0008] The icons are preferably positioned relative to 

35 each other by multidimensional scaling of the deter- 
mined similarities among the unstructured text items. 
[0009] A new text Hem may be entered into the expe- 
rience journal after optional review and editing by a 
screening committee by determining similarity between 

40 the new text item and other text items in the experience 
journal, displaying a new icon corresponding to the new 
text item on the display screen and positioning the new 
icon relative to other icons such that the distances 
between the icons are representative of the determined 

45 similarities among the text items. 

[0010] The experience journal may be implemented 
as a website on the World Wide Web. Alternatively, the 
experience journal may be implemented on a local area 
network, on a single computer or on a combination of 

so the World Wide Web and one or more local area net- 
works. The principal user functions include accessing 
entries in the experience journal and submitting new 
entries for inclusion in the experience journal. 

55 Brief Descript ion of the Drawings 

[0011] For a better understanding of the present 
invention, reference is made to the accompanying draw- 
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ings. which are incorporated herein by reference and in 
which: 

FIG. 1 is a block diagram of a computer system suit- 
able for implementing the present invention; 
FIG. 2 is a block diagram of a computer network 
suitable for implementing the present invention; 
FIG. 3 is a flow chart of the operation of an experi- 
ence journal in accordance with an embodiment of 
the invention; 

FIG. 4 is a flow chart of an example of a process for 
determining similarity between text items shown in 
FIG. 3; 

FIG. 5 is a flow chart of an example of a process for 
determining icon positions shown in FIG. 3; 
FIG. 6 illustrates an example of a welcome display 
window that may be used in the experience journal; 
FIG. 7 illustrates an example of an icon display win- 
dow containing icons that represent text items in 
the experience journal; 

FIG. 8 illustrates an example of a display window for 
entry of a text item into the experience journal; and 
FIG. 9 illustrates an example of a display window for 
control of the experience journal by a screening 
committee. 

Detailed Description 

[0012] FIG. 1 is a block diagram of an example of a 
computer system that may be used in implementing the 
present invention. A central processing unit (CPU) 10 is 
connected to a memory 12 that stores instructions to be 
executed by the CPU 10. The memory 12 can be any 
type of memory, including RAM. ROM, CD ROM, mag- 
netic disk, hard disk, etc. Data relating to execution of 
the instructions may also be stored in the memory 12. 
Alternatively, different memories can be used for the 
instructions and the data. The CPU 10 is also con- 
nected to a video display unit 16 for displaying informa- 
tion to a user. The user can input information to the CPU 
through a user interface including a keyboard 20 and a 
pointing device 22, such as a mouse or a trackball. A 
network interface 24 connected to CPU 10 may be used 
to interface the computer system to a local area network 
(LAN) or a wide area network (WAN) . such as the 
World Wide Web. It will be understood that the compu- 
ter system may have a variety or different components, 
configurations and capabilities within the scope of the 
present invention. 

[001 3] A computer network suitable for implementa- 
tion of the present invention is shown in FIG. 2. Comput- 
ers 30, 32, ... 33 are interconnected on a network 40 
which may be a LAN, the World Wide Web or a combi- 
nation of a LAN and the World Wide Web. One of the 
computers, such as computer 30, may be designated as 
a website for the computerized experience journal of the 
invention. The computer 30 may be used for control of 
the experience journal by a screening committee as 



described below. The remaining computers may access 
text entries in the experience journal through network 
40 and may submit new text items for inclusion in the 
experience journal. 

5 [0014] One purpose of the invention is to provide a 
way for a community to gather, organize and share the 
community's collective wisdom on a given topic or issue 
in an experience journal. In one example, the commu- 
nity includes the families of children who have been 

10 through a cardiac unit at a hospital. Families can con- 
tribute to the experience journal by submitting personal 
narratives about their experiences, helpful information, 
poems and stories, as well as pictures and video about 
the experience of coping with a serious illness. 

75 [001 5] One feature of such personal narratives is that 
they are quite unlike the logically structured and uniform 
text that is found in media such as encyclopedia articles 
and newspaper stories. In general, the personal narra- 
tives may be considered unstructured, anecdotal text. 

20 Much of the accepted wisdom about text-based infor- 
mation retrieval has been developed for more^structured 
text data. For example, in structured text data, research- 
ers have found that specification of an indexing or a 
classification scheme can be useful. However, in an 

25 experience journal, it may be difficult to predict the top- 
ics that contributors will want to address, and restricting 
contributions to a predefined set of topics is an unac- 
ceptable constraint. 

[0016] Another key distinction is that an experience 

30 journal should lend itself to browsing. The type and 
degree of organization needed to support effective 
browsing is very different from that which is needed to 
support effective querying. Therefore, the fundamental 
operation in an experience journal tends not be "find an 

35 entry with the following keywords", so much as "find an 
entry that is related to the one I just read" or "find an 
entry that is very different from the one I just read". 
[0017] An important characteristic of an experience 
journal is that it will grow and evolve continuously over 

40 time, even while people are reading it. For this reason, a 
preset limit on the size of an experience journal is unac- 
ceptable. Furthermore, contributors will hope to see an 
entry included within the experience journal soon after 
writing it. Therefore, a model wherein successive expe- 

45 rience journal versions are produced and released at 
intervals may be undesirable. 
[0018] Contributions to an experience journal will 
come from many individuals in many styles and forms. 
In order to make authoring and submission as easy as 

so possible, minimal assumptions are made about the form 
of an experience journal entry. 
[001 9] Websites that contain experience journal data 
can be organized and maintained by a human webmas- 
ter. However, sufficient funds to pay a professional web- 

55 master may not be available. Accordingly, the 
experience journal should be self-organizing and self- 
evolving, and should be capable of operating indefinitely 
without supervision or intervention by specifically 
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trained people. 

[0020] In addition to being able to browse an experi- 
ence journal within a hospital, participating families 
should be able to access it over the Internet, using a 
variety of different web browsers. Furthermore, users 
with only the basic computer skills should be able to 
browse an experience journal. For this reason, the inter- 
face to the experience journal preferably uses simple 
point and click actions rather than more sophisticated 
access and browsing models, such as Boolean queries 
or tailored databases. 

[0021 ] A flow chart of the operation of the experience 
journal in accordance with the invention is shown in 
FIG. 3. Major functions available to the user include 
submission of text items for inclusion in the experience 
journal and access to text Herns in the experience jour- 
nal. A particular user may be involved with either or both 
functions. 

[0022] A text Hern for entry into the experience journal 
can be a HTML document (created with Netscape Nav- 
igator Gold or Microsoft Front Page, for example), an 
ASCII text file or a URL for either type of file After it is 
composed by a contributing patient or family in step 
100, a text item for the experience journal may be com- 
municated by e-mail to the members of a screening 
committee, which may be composed of members of the 
hospital staff and parents of present and former 
patients. The task of the committee is to review the sub- 
mitted text entry in step 102 to determine whether the 
text item should be entered into the experience journal 
as submitted or whether it should be edited to eliminate 
material that could be medically harmful or legally com- 
promising. The text item may be edited as necessary. 
When committee acceptance is given, the text Hern is 
then translated automatically into HTML format, if nec- 
essary, and is incorporated into the experience journal 
in step 104. 

[0023] The relationship of the new entry to existing 
experience journal entries must be established. In step 
106, the similarity among the text items in the experi- 
ence journal is determined. The similarity between each 
pair of text items is quantified as a similarity score as 
described in detail below in connection with FIG. 4. The 
similarity scores are used in step 108 to determine icon 
positions in a display window. In particular, each text 
item in the experience journal is represented by an icon 
110 in a display window 116, as shown in FIG. 7. The 
icons are positioned in the display window 116 relative 
to each other, such that the distances between the icons 
are representative of the determined similarities among 
the text items. Thus, for example with reference to FIG. 
7, icons 112 and 114 represent text items that have a 
high degree of similarity, whereas icons 110 and 112 
represent text Hems that have a relatively low degree of 
similarity. A preferred technique for determining icon 
positions is described below in connection with FIG. 5. 
After the icon positions have been determined, the icon 
display window 116 is generated in step 120. As indi- 



cated above, the positions of the icons in the display are 
indicative of similarity. Thus, groups of iconscorrespond 
to similar text items in the experience journal. 
[0024] The experience journal is then available for 

s access by users. In step 122, the process waits for user 
selection of an icon, typically by pointing and clicking 
with pointing device 22 (FIG. 1). In step 124, a determi- 
nation is made as to whether a user selection has been 
made. When a user selection has not been made, the 

10 process continues to wait in step 122. When a user 
selection is made, the text item corresponding to the 
selected icon is displayed in step 130. As shown in FIG. 
7, a selected text item 132 is displayed in a text window 
136 in the left side of the display screen. An icon 134 

is corresponding to the text item 132 may be highlighted 
for reference by the user in selecting similar or dissimilar 
text items. 

[0025] A flowchart of an example of a process for 
determining similarity among text items in the experi- 

20 ertce journal is shown in FIG. 4. In step 200, stop words 
are removed from each text entry. Stop words, such as 
"the", "and", "is", etc., are those words which occur with 
such high frequency in the text as to be useless for 
determining similarity in meaning between text items. In 

25 step 202 suffixes, such as "ly", "ing*\ etc.. are stripped 
from the words in the text items. In step 204, the remain- 
ing words in each text item are weighted according to 
the inverse of their occurrence frequency in a large ref- 
erence text. Thus, rarely used words are weighted 

30 heavily, and commonly used words are discounted. The 
weights for each text item are represented as a word 
vector. A similarity score for a pair of text items is deter- 
mined in step 206 as the dot product of the word vectors 
associated with the two text items. The procedure of 

35 FIG. 4 is repeated for each pair of text items in the expe- 
rience journal, thus providing measures of similarity 
among all the text items in the experience journal. Tech- 
niques for determining similarity between texts based 
on similarity scores are described by G. Salton in Auto- 

40 matic Text P rocessing: The Transformation. Analysis 
and Retrieval of Information by Computer. Reading, MA: 
Addison-Wesley, 1989, chapters 8-10. 
[0026] The similarity scores for the pairs of text items 
in the experience journal are used to determine the 

45 positions of the icons in the display window 116. The 
technique of multidimensional scaling may be used to 
position icons so that similar entries in the experience 
journal are located near each other in the display win- 
dow and dissimilar entries are located far apart. Thus. 

so proximity rather than coordinate location is the measure 
of similarity. Clusters of similar entries form naturally. 
Techniques for multidimensional scaling are described 
by I. Borg et al in Modern Mult idimensional Scaling: 
Theory and Applications. Berlin : Springer, 1997 and by 

55 J.B. Kruskal in "Multidimensional Scaling and Other 
Methods for Discovering Structure", Statistical Methods 
for Digital Computers. Englein, Ralston. Wilf (eds.). vol. 
3. Wiley, pages 296-335. 
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[0027] A flow chart or the process for determining icon 
positions on the display screen is shown in FIG. 5. The 
similarity scores, or "distances", are obtained from the 
process of FIG. 4 in step 250. Points corresponding to 
the text items are embedded in a multidimensional 
space in step 252 such that all pairs of points are the 
desired distances from each other. In step 254, the 
points are projected onto a two-dimensional space in 
which these distances are best preserved. The icons 
are displayed in step 256 using their corresponding 
point positions in the two-dimensional space. 
[0028] Given any pairwise distance measures 
between points, it is possible to embed the points in 
some Euclidean space such that the points are the 
required distances from each other. This is due to a the- 
orem by Young and Householder. If this space is two- 
dimensional, then we could use the point coordinates in 
the space as the coordinates for the screen display of 
the icons. However, this is rarely the case; usually the 
Euclidean space in which the points are embedded has 
very high dimensionality. In order to use this embedding 
to inform the display layout it is necessary to project the 
points from the multi-dimensional Euclidean space to a 
two-dimensional Euclidean space. There are of course 
an infinite number of ways to do this projection. We 
would like to find the projection that best preserves the 
distances between points. This projection is found by 
using Principal Component Analysis. The principal 
component is any vector such that the perpendicular 
projections of the points from the multi-dimensional 
space in which they were embedded onto that vector 
have the largest possible variance. The second princi- 
pal-component vector is any vector orthogonal to the 
first, such that the perpendicular projections of the 
points onto it have the largest possible variance. The 
first two principal-component vectors therefore define a 
desired two-dimensional Euclidean space, i.e., one that 
preserves the interpoint distances. In fact, it can be 
proved that the two-dimensional Euclidean space 
defined by the principal-component vectors is the best 
two-dimensional space for preserving these distances. 
Once the points have been projected onto this two- 
dimensional space, the point coordinates can be used 
as the locations of the corresponding icons in the 
screen display. 

[0029] Examples of display screens that may be uti- 
lized in implementing the computerized experience jour- 
nal of the present invention are shown in FIGS. 6-9. An 
example of a welcome display is shown in FIG. 6. A wel- 
come display window 300 includes a pond 302 having 
lilypad icons 304 and a frog icon 306. By pointing and 
clicking on various components of the welcome display 
window 300, different informational messages are given 
to the user. Menu buttons at the top of the display may 
be used for selecting other windows in the experience 
journal. A button 310 is used for selecting the welcome 
display window 300. A button 312 is used for selecting a 
"read" display as shown in FIG. 7. A button 314 is used 



for selecting a "write" display as shown in FIG. 8. 
[0030] The read display, an example of which isshown 
in FIG. 7, is used for accessing and reading text items in 
the experience journal. The icon display window 116 

5 includes icons, such as lily pad icons 110, 112, 114, 
etc., distributed in a background area, such as a pond 
332. As discussed above, each of the icons corre- 
sponds to one of the text items in the experience jour- 
nal. Distances between icons in display window 1 16 are 

10 representative of similarity between the corresponding 
text items. Thus, icons are closely spaced when the cor- 
responding text items are relatively similar, and icons 
are widely spaced when corresponding text items are 
relatively dissimilar. The read display further includes 

is text window 1 36 for displaying a selected text item, such 
as text item 132. The corresponding icon 134 may be 
highlighted in the icon display window. A new text item 
may be selected by pointing to one of the icons and 
clicking the pointing device. The used may select similar 

20 text items by selecting icons that are clustered together 
and may select dissimilar text items by selecting icons 
that are widely spaced. 

[0031 ] Additional optional features may be included in 
the read display. For example, the icon display window 

25 116 may indicate the age of text items in the experience 
journal. In particular, the intensity of the icons may indi- 
cate the time from acceptance into the experience jour- 
nal. Newer text items are represented by relatively 
intense icons, whereas older text items are represented 

so by less intense, faded icons. An icon representative of a 
particular text item is initially intense following accept- 
ance and fades with time toward the background color. 
The icons never disappear entirely but instead reach a 
minimum intensity level. The aging of the display may 

as be accomplished by attaching a time stamp indicative of 
the time of acceptance to each text item. The intensity 
of the corresponding icon is based on a difference 
between the present time and the time of acceptance. It 
will be understood that display parameters other than 

40 intensity may be utilized to indicate the ages of the text 
Hems. 

[0032] In another example of an optional feature, the 
icon display window 1 1 6 may indicate a trail of text items 
selected by the user. The trail may be indicated, for 

45 example, by icons of a selected color, with successively 
reduced brightness indicating less recently accessed 
text items. The selected color is different from the color 
of text items that have not been selected by the user. 
[0033] An additional optional feature is the ability to 

50 magnify a portion of the icon display window 1 16. Mag- 
nification buttons 350, 352, 354, etc. may be provided at 
the top of the display screen. Different magnification 
buttons provide different magnification factors. When 
one of the magnification buttons 350, 352, 354, .etc. is 

55 selected, a magnified region 360 appears in icon win- 
dow 116. Within magnified region 360, the distances 
between icons are magnified to facilitate selection of a 
text item. The magnification feature is useful in the case 
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of clusters of icons which represent similar text items. 
The magnified region 360 may be moved in the icon 
window 116 using the pointing device. A button 370 is 
used to establish a view at normal scale without magni- 
fication. 

[0034] An additional optional feature is associated 
with icons 380, 382, 384, etc. positioned below icon win- 
dow 116. Icons 380, 382, 384 represent linked text 
items in the experience journal. The text items may be 
linked according to similarity or according to another cri- 
teria. By selecting one of the icons 380, 392, 384 etc., 
the corresponding linked text item is accessed and is 
displayed in text window 136. 
[0035] The experience journal icon display window 
has been described as including lilypad icons on a pond 
background. It will be understood that any suitable icons 
and any suitable background may be utilized within the 
scope of the present invention. Different backgrounds, 
such as for example a cityscape or a sunset, may be uti- 
lized as appropriate for a particular application of the 
experience journal. Furthermore, the buttons on the 
experience journal display may be changed to provide 
different options and functions as necessary for a par- 
ticular application. 

[0036] An example of a write display for adding text 
items to the experience journal is shown in FIG. 8. The 
user enters a narrative text, with few if any restrictions, 
using keyboard 20 (FIG; 1). After the text item is com- 
pleted to the satisfaction of the user, it can be submitted 
to the experience journal for review and editing by the 
review board by selecting button 400. This causes the 
new text item to be sent on the network to the control 
computer 30 (FIG. 2). Conventional text entry functions 
may be provided in the write display. In the example of 
FIG. 8, the write display incudes a new document button 
410, an open document button 412, a save button 414, 
a save copy button 416, a print button 418, a cut button 
420, a copy button 422 and a paste button 424. A button 
430 may provide access to an administrative display, an 
example of which is shown in FIG. 9 and described 
below. A password may be required for access to the 
administrative display. 

[0037] An example of an administrative display used 
by the screening committee in the control computer is 
shown in FIG. 9. An "included" window 500 shows a list 
of text items previously included in the experience jour- 
nal by title. A "submitted" window 502 indicates text 
items submitted by users for entry into the experience 
journal but not yet reviewed by the screening commit- 
tee. A "hold" window 504 lists text items that have been 
submitted but which have been placed on hold for fur- 
ther review and/or editing. A "text" window 510 displays 
a selected text item from one of the windows 500. 502 
or 504. The text item in window 510 may be edited by 
the screening committee to eliminate material that can 
be medically harmful or legally compromising to the 
hospital or other organization. When the text item is 
accepted by the screening committee, it may be 



included in the experience journal by selecting button 
520. Thus, button 520 causes a selected text item to be 
transferred from submitted window 502 to included win- 
dow 500. The processes described above for determin- 

5 ing similarity between the new text item and the other 
text items in the experience journal and for positioning 
an icon in the icon window are executed. A new icon 
corresponding to the newly entered text item then 
appears in the icon window. The new text item is then 

io available for selection by users in the read display. 
[0038] Additional function buttons on the administra- 
tive display include a button 522 for transferring a text 
item from hold window 504 to submitted window 502. A 
button 524 transfers a selected text item from submitted 

is window 502 to hold window 504. A button 530 selects a 
dialog box that may be used to access a different expe- 
rience journal or to create a new experience journal. 
Button 530 is used in connection with systems that may 
provide access to more than one experience journal. 

20 [0039] The experience journal has been described 
thus far in connection a medical application wherein the 
experience journal contains stories and other narratives 
relating to an illness. However, the experience journal 
may be utilized in any situation where a community 

25 wishes to share unstructured information relating to a 
particular topic or issue. The experience journal is par- 
ticularly useful where the entries are unrestricted as to 
style, content and form and where the experience jour- 
nal may be accessed by users having very limited com- 

30 puter skills. In some applications, the screening 
committee, which edits and accepts entries in the expe- 
rience journal, may be unnecessary. In this case, newly 
submitted text items are included directly in the experi- 
ence journal without editing or review. Likewise, in some 

35 applications, states of the experience journal may be 
fixed for periodic distribution. 
[0040] Having thus described certain embodiments of 
the present invention, various alterations, modifications, 
and improvements will readily occur to those skilled in 

40 the art. Such alterations, modifications, and improve- 
ments are intended to be within the spirit and scope of 
the invention. Accordingly, the foregoing description is 
by way of example only, and not intended to be limiting. 
The invention is limited only as defined in the following 

45 claims and the equivalents thereof. 

Claims 

1. A method for providing access to an experience 
so journal including unstructured text items relating to 
a topic in a computer system comprising a proces- 
sor, a storage device, a video display unit having a 
display screen, and a user interface, said computer 
system executing the steps comprising: 

55 

storing the unstructured text items in the stor- 
age device; 

determining similarity among the unstructured 
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text Items according to a criteria; 
displaying icons, one corresponding to each of 
the unstructured text items, on the display 
screen, including positioning said icons relative 
to each other such that the distances between s 
said icons are representative of the determined 
similarities among the unstructured text items; 
and 

in response to user selection of one of said 
icons, displaying the corresponding unstruc- 10 
tured text item on the display screen. 

2. A method for providing access to an experience 
journal as defined in claim 1, wherein the step of 
determining similarity comprises analyzing the is 
unstructured text items for use of the same words. 

3. A method for providing access to an experience 
journal as defined in claim 1, wherein the step of 
determining similarity comprises the steps of: so 

removing stop words from the unstructured text 
items; 

removing suffixes from words in the unstruc- 
tured text items; 25 
weighting remaining words in the unstructured 
text items according to the inverse of their 
occurrence frequency in a reference text to pro- 
vide a word vector for each of the text items; 
and so 
determining a similarity score for each pair of 
unstructured text items by determining the dot 
product of the corresponding word vectors. 

4. A method for providing access to an experience 35 
journal as defined in claim 3, wherein the step of 
positioning said icons comprises the steps of: 

embedding points corresponding to said text 
items in a multidimensional space such that <o 
pairs of points are spaced by distances repre- 
sentative of the similarity score for the corre- 
sponding pair of text Hems; 
projecting said points onto a two-dimensional 
space in which the distances between said 45 
points are preserved; and 
displaying said icons using their corresponding 
point positions in the two-dimensional space. 

5. A method for providing access to an experience so 
journal as defined in claim 1, wherein the step of 
displaying icons comprises the step of indicating 
relative ages of corresponding text items in the 
experience journal. 

55 

6. A method for providing access to an experience 
journal as defined in claim 5. wherein the step of 
indicating ages of text items comprises varying a 



parameter of the icon in accordance with the age of 
the corresponding text item. 

7. A method for providing aocess to an experience 
journal as defined in claim 1, wherein the step of 
positioning said icons comprises the step of multidi- 
mensional scaling of the determined similarities 
among the unstructured text items. 

8. A method for providing access to an experience 
journal as defined in claim 1 , wherein the step of 
displaying icons comprises the step of magnifying a 
selected region of the display screen. 

9. A method for providing aocess to an experience 
journal as defined in claim 1 , further comprising the 
step of identifying recently-accessed text items by 
marking the corresponding icons on the display 
screen. 

10. A method for providing access to an experience 
journal as defined in claim 1 , further comprising the 
steps of linking similar ones of said unstructured 
text items and indicating the linked text items by 
marking the corresponding icons on the display 
screen. 

11. A method for providing aocess to an experience 
journal as defined in claim 1 , further comprising the 
step of entering a new unstructured text item into 
the experience journal, comprising the steps of 
determining similarity between the new text item 
and other text items in the experience journal, dis- 
playing a new icon corresponding to the new text 
item on the display screen and positioning the new 
icon relative to other icons such that the distances 
between said icons are representative of the deter- 
mined similarities among the text items. 

12. A method for providing access to an experience 
journal as defined in claim 11, wherein the step of 
entering a new text item further comprises the step 
of entering an edited text item in response to inputs 
by a user. 

13. A method for providing access to an experience 
journal as defined in claim 1 , further comprising the 
step of making the experience journal available for 
access on a local area network. 

14. A method for providing access to an experience 
journal as defined in claim 1 , further comprising the 
step of making the experience journal available for 
access on the world wide web. 

15. A method for providing access to an experience 
journal as defined in claim 14, further {jomprising 
the step of receiving an unstructured text item for 
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entry into the experience journal on the world wide 
web. 

16. A method for providing access to an experience 
journal as defined in claim 1, wherein the step of s 
displaying icons comprises transmitting information 
representative of said icons and the relative posi- 
tions thereof to a remote user on the world wide 
web for display on the remote user's display screen 
and wherein the step of displaying the text item cor- 10 
responding to the selected icon comprises transmit- 
ting the corresponding text item to the remote user 

on the world wide web for display on the user's dis- 
play screen. 

75 

17. A method for providing access to an experience 
journal as defined in claim 1 , wherein said unstruc- 
tured text items relate to a medical condition. 

18. Apparatus for providing access to an experience 20 
journal including unstructured text items relating to 
atopic, said apparatus comprising: 



19. Apparatus for providing access to an experience 
journal as defined in claim 18 wherein said means 

for determining similarity comprises means for ana- 45 
lyzing the unstructured text items for use of the 
same words. 

20. Apparatus for providing access to an experience 
journal as defined in claim 18, wherein said means so 
for determining similarity comprises: 



occurrence frequency in a reference text to pro- 
vide a word vector for each of the text items; 
means for determining a similarity score for 
each pair of unstructured text items by deter- 
mining the dot product of the corresponding 
word vectors. 

21. Apparatus for providing access to an experience 
journal as defined in claim 20, wherein said means 
for positioning said icons comprises: 

means for embedding points corresponding to 
said text items in a multidimensional space 
such that pairs of points are spaced by dis- 
tances representative of the similarity score for 
the corresponding pair of text items; 
means for projecting said points onto a two- 
dimensional space in which the distances 
between said points are preserved; and 
means for displaying said icons using their cor- 
responding point positions in said two-dimen- 
sional space. 

22. Apparatus for providing access to an experience 
journal as defined in claim 18, wherein said means 
for displaying icons further comprises means for 
displaying said icons with an indication of the age of 
the corresponding text item in said experience jour- 
nal. 

23. Apparatus for providing access to an experience 
journal as defined in claim 18, wherein said means 
for positioning said icons comprises means for mul- 
tidimensional scaling of the determined similarities 
among the unstructured text items. 

24. Apparatus for providing access to an experience 
journal as defined in claim 18, further comprising 
means for identifying recently accessed text items 
by marking the corresponding icons on the display 
screen. 

25. Apparatus for providing access to an experience 
journal as defined in claim 18, further comprising 
means for linking similar ones of said unstructured 
text items and means for indicating the linked text 
items by marking the corresponding icons on the 
display screen. 



a computer system including a processor, a 
storage device, a video display unit having a 25 
display screen, and a user interface, said com- 
puter system further comprising: 
means for storing the unstructured text items in 
the storage device; 

means for determining similarity among the 30 
unstructured text items; 
means for displaying icons, one corresponding 
to each of the unstructured text items, on the 
display screen, including means for positioning 
said icons relative to each other such that the 35 
distances between said icons are representa- 
tive of the determined similarities among the 
unstructured text items; and 
means responsive to user selection of one of 
said icons for displaying the corresponding ao 
unstructured text Hern on the display screen. 



means for removing stop words from the 
unstructured text items; 

means for removing suffixes from words in the ss 
unstructured text items; 
weighting remaining words in the unstructured 
text items according to the inverse of their 
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